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def 

Quantum relative entropy D(p\\a) = Trp(logp — logo - ) plays an important role in 
quantum information and related fields. However, there are many quantum analogues 
of relative entropy. In this paper, we characterize these analogues from information 
geometrical viewpoint. We also consider the naturalness of quantum relative entropy 
among these analogues. 
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1 Introduction 

In the quantum information theory, we usually focus on the quantum relative entropy D(p\\a) d = 
Trp(logp — log a) as a quantum analogue of relative entropy (divergence). However, there 
are many kinds of quantum analogues of relative entropy. Some of them have been discussed 
from the viewpoint of operator algebra [3 El- In the classical information geometry, the diver- 
gence can be defined by using the integral along the autoparallel curve. Since the geometrical 
approach in classical information systems is very attractive, excellent insights for quantum 
information system can be expected through the consideration from geometrical viewpoints. 

By extending this definition to the quantum system, Nagaoka defined quantum ana- 

logues of divergence based on the integral along the parallel translation, e-parallel translation 
and m-parallel translation are known as most popular parallel translations in the quantum 
system as well as in the classical system. These divergences are called e-path-divergence and 
e-path-divergence, respectively. In particular, In the classical system, the path-divergences of 
both translations give the usual relative entropy. On the other hand, Fisher information is 
unique in the classical system. However, it is not unique in the quantum system. PetzjH] com- 
pletely characterized its quantum analogues. As famous examples, SLD Fisher information, 
RLD Fisher information, and Bogoljubov Fisher information are known El El H El- Nagaoka 
showed that the quantum path-divergence concerning e (m)-parallel translation coincides with 
the quantum relative entropy D{p\\a) when the quantum Fisher information of interest is Bo- 
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2 Characterization of quantum analogues of relative entropy 

goljubov Fisher informational ITU] . He also calculated the quantum path-divergence with the 
SLD Fisher information concerning e-parallel translation^. 

In this paper, we calculate the quantum path-divergence other than the above cases. 
Then, we succeeded in relating information geometrical path-divergence and an operator- 
algebraic divergence D(p\\a) — Tr p\og(pz a^ 1 p? ), was introduced through operator-algebraic 
context by Belavkin and Staszewski Further, we proved the additivity of quantum 

path-divergence defined by e-parallel translation, and the monotonicity of quantum path- 
divergence defined by m-parallel translation. These two parallel translations are the dual 
parallel translations of each other. Since these two properties are fundamental, they are 
expected to be applied in the research field of quantum information. 

In the classical system, the divergence also can be defined from a convex function. Hence, 
divergence is closely related to convex analysis. Amari & Nagaoka 1 showed that only 
Bogoljubov Fisher information has zero-torsion. That is, the geometry of Bogoljubov inner 
product has the dual flat structure. They also proved the equivalence of the following two 
conditions. 1) The path-divergences of dual parallel translations can be given from potential 
function. 2) The dual parallel translation has the dual flat structure. Hence, in the quantum 
case, we can conclude that only path-divergences of Bogoljubov Fisher information is given by 
a potential function. This result indicates that the geometry of Bogoljubov Fisher information 
is closely related to optimization problem in quantum system. In fact, in their proof, the 
calculations concerning Christoffel symbols were essentially used. However, many quantum 
information scientists are not familiar to such analysis. In this paper, we give another proof 
of this argument without any use of Christoffel symbols. This paper can be expected to be a 
good guidance for quantum information geometry for quantum information scientist. 

This paper is organized as follows. In section we review the information geometri- 
cal characterization of divergence D{jp\\q) in the classical system, we also review how the 
divergence can be defined by the convex function in the classical system. In section we 
give a review of inner product in quantum systems, which is a fundamental of quantum in- 
formation geometry. In section two kinds of autoparallel translations and autoparallel 
curves are reviewed. In sectional we treat quantum analogues of relative entropy from the 
operator- algebraic viewpoint. In sectional we examine quantum path-divergences based on 
e-autoparallel translation, and consider their properties. In section we examine quantum 
path-divergences based on m-autoparallel translation, and consider their properties. In par- 
ticular, the relation between an operator- algebraic divergence and quantum path-divergences 
based on e (m)-autoparallel translation are derived in section |^| (|7|l . 

2 Divergence in Classical Systems 

First, we review the information geometrical characterization of divergence D{p\\q) in the 
classical system £Q. Let p(uj) be a probability distribution, and X(u>) be a random variable. 
When the family {pe\8 S 8} has the form 



Pb{lo) = p{co)e' 



,ex{u))-n{6) 



(1) 
(2) 
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the logarithmic derivative at respective points equals the logarithmic derivative at a fixed 
point with the addition of a constant. In this case, the family is called an exponential family, 
and [i(6) is called the moment function of X. In particular, since the logarithmic derivative 
is closely related to exponential families, it is often called the exponential (e) representation 
of the derivative. Therefore, we use the superscript (e) in the inner product ( , The 
function /i(9) is often called a potential function in the context of information geometry. 
Since the second derivative n"{0) is the Fisher information Jg > 0, the moment function 
n{9) is a convex function. Therefore, the first derivative //(#) = P6{w)X(lo) is monotone 
increasing. That is, we may regard it as another parameter identifying the distribution p$, 
and denote it by r\. The original parameter 6 is called a natural parameter and the other 
parameter rj is an expectation parameter. For example, in the binomial distribution, the 
parameterization pe(0) = 1/(1 + e e ), pe{l) = e e /(l + e e ) is the natural parameter, and the 
parameterization p v (0) = t], ^(1) = 1 — r? is the expectation parameter. Hence, the binomial 
distribution is an exponential family. 

Further, let Xi(u>), . . . , Xk(oS) be k random variables. We can define a /c-parameter expo- 
nential family 

M<=iog]>>M e ^ eVQ K (3) 

UJ 

The parameters 9 l are natural parameters, and the other parameters r/i = = J2uj P& ( w )-^'t( w ) 
are expectation parameters. Since the second derivative ggjggi is equal to the Fisher Infor- 
mation matrix Je-.ij, the moment function fj,(9) is a convex function. 

Let be a twice-diffcrentiable and strictly convex function defined on a subset of the d- 
dimcnsional real vector space M. d . The divergence concerning the convex function [i is defined 

by 

d»(§\\6) d ^ f ]T Vi(0W - P) n(§) + n(0), 

i 

*W = J£W- (4) 
This quantity has the following two characterizations: 

D»{9\\9) = max ^ (6)0* - 9*) - ^6) + p(0) 

i,j 

In the one-parameter case, we obtain 

D»(9\\9) = »'(&)(§ -9) - n(8) + n(6) 

r s _ _ 

= maxfj,'(0)(6-6)-n(6)+n(0)= n" {0)(0 - O)d0 . (6) 
e Je 
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Since the function /i is strictly convex, the correspondence 9 l «-> rji — J^j is one-to-one. 
Hence, the divergence D^(9\\9) can be expressed with the parameter rj. For this purpose, we 
define the Lcgcndrc transform v of fi 

Kr?) d = maxy># - (7) 
Then, the function ^ is a convex function, and we can recover the function [i and 9 as 

fx(d) = max V Stf - v(H), 0* = - A 
v t-r 1 or}i 

The second derivative matrix g " ^ of ^ is equal to the inverse of the matrix 

90 



In particular, when rji — §fjr(9), 



v(v) = E ^ - = D ^ e W°) - A*(°). (8) 

= E^ - <n) = D"(v\\o) - "(Q). (9) 

Using this relation, we can characterize the divergence concerning the convex function /j, by 
the divergence concerning the convex function v as 

d»%\q) = D v ( v \\fj) = J2 - ^) - "fa) + (10) 

i 

Now, we apply the discussion about the divergence to a multi-parametric exponential 
family {p g \6 E R} defined in © UJ. Then, 

D(pg\\p 8 ) = D»(0\\9) = J2 - ° l ) - + 

In particular, applying JSJ to a one-parameter exponential family JIJ, we have 

= [ J § (9-e)d6 = max(e-9)(r](e) + e)-n(6) + n(9). (11) 

In the following, we consider the case where p is the uniform distribution p m i x . Let the 
random variables Xi(cu), . . . , Xj~(uj) be a CONS of the space of random variables with expec- 
tation under the uniform distribution p m i x , and Y l (ui), . . . , Y k (ui) be its dual basis satisfying 
^2 UJ Y l (uj)Xj(uj) — Sj. Then, any distribution can be parameterized by the expectation pa- 
rameter as 

Pt,{6){u) =Pmix(u) + ^2?]i(6)Y 1 (uj). 

i 

From (JTOJ and 10), 

dv 

d (pv\\pv) = D "(vh) = -E-iVi - m) - v(v) + 

v{rj) = D(p v \\p mbc ) = -H(jp v ) + H(p mix ) 
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because p(0) — 0. The second derivative matrix of v is the inverse of the second derivative 
matrix of p, i.e., the Fisher information matrix concerning the natural parameter 9. That 
is, the second derivative matrix of v coincides with the Fisher information matrix concerning 
the expectation parameter r\. Hence, applying © to the subspace {(1 — t)p + tq\0 < t < 1}, 
we have 

D(p\\q)= f Jttdt, (12) 
Jo 

where Jt is the Fisher information concerning the parameter t. 
3 Inner Products in Quantum Systems 

In this section, in order to define the quantum analogues of divergence, we define as inner 
products in quantum systems. There are at least three possible ways of defining the product 
corresponding to Xp: 

def T , def 1 

2 



E p , s {X)=Xop^-{pX + Xp), (13) 



£ p , b (X) d = / p x Xp l - x d\, 
Jo 



E p , r (X) d =pX. (14) 
Here, X is not necessarily Hermitian. These extensions are unified in the general form [S] 



def 



E PtP (X) = / E PtX (X)p(dX), (15) 
Jo 

£ P ,ApO d = p X Xp l ~ x , (16) 



where p is an arbitrary probability distribution on [0, 1]. The case (|13|l corresponds to the 
case lfL5|) with p(l) = p(0) = 1/2, and the case (|T4"}) does to the case l(T5)l with p(l) = 1. 
In particular, the map E PjX is symmetric, when E PtX (X) is Hermitian if and only if X is 
Hermitian. Hence, when the distribution p is symmetric, i.e., p(x) = p(l — x), the map E p ^ p 
is symmetric. When p > 0, these maps possess inverses. 
Accordingly, we may define these types of inner products 

(Y,X) p % d ^TrY*E p , x (X) x = s,b,r,X,p. 

If X, Y, p all commute, these have the same value. These are called the SLD, Bogoljubovf 
RLD, A, and p inner products [3 |H1 El E E] 7 respectively (reasons for this will be given in the 
next section). These inner products are positive semi-definite and Hermitian, i.e., 

(||x||^)) 2 d ^ f {x,x)M > o, (y,x) p % = ((x,y)W)*. 

A dual inner product may be defined (A,B) p m J = Tr(E~l.(A))* B with respect to the corre- 
spondence A — E PtX (X). Denote the norm of these inner products as (jl^lllp™' 1 ^ = f (A, A) p m J. 

"The Bogoljubov inner product is also called the canonical correlation in statistical mechanics. In linear 
response theory, it is often used to give an approximate correlation between two different physical quantities. 
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Hence, the inner product (A,B) ™J is positive semi-definite and Hermitian. Using this inner 
product, we define quantum analogues of Fisher information as 



for a one-parameter family {pg} and x = s,r, b, A, p. 
4 Autoparallel Curves in Quantum Systems 

Next, we define parallel transport and autoparallel curves in quantum systems according 
to Nagaoka J2J and Amari & Nagaoka 1 . To introduce the concept of a parallel transport, 
consider an infinitesimal displacement in a one-parameter quantum state family {pg\9 € K}. 
The difference between pg +e and pg approximately equals to ^-(0)e. Hence, the state pg+ e 
can be regarded as the state transported from the state pg in the direction -^§§-{9) by an 
amount e. However, if the state pg+ e coincides precisely with the state displaced from the 
state pg by e in the direction of the infinitesimal displacement at the intermediate 

states pg+ e i (0 < e' < e) must equal the infinitesimal displacement -^§§-(9) A at 9. Then, the 
problem is to ascertain which infinitesimal displacement at the point 9 + e' corresponds to 
the given infinitesimal displacement -^#(0)A at the initial point 9. The rule for matching 
the infinitesimal displacement at one point to the infinitesimal displacement at another point 
is called parallel transport. The coefficient -^§-{9) of the infinitesimal displacement at 9 is 
called the tangent vector, as it represents the slope of the tangent line of the state family 
{pg\9 £ R} at 9. Therefore, we can consider the parallel transport of a tangent vector instead 
of the parallel transport of an infinitesimal displacement. 

Commonly used parallel transports can be classified into those based on the m representa- 
tion (m parallel translation) and those based on the e representation (e parallel translation) . 
The m parallel translation Hpfy> e i moves the tangent vector at one point pg to the tangent 
vector with the same m representation at another point pg>. On the other hand, the e parallel 

(e) 

translation ILx,p e ,p g , moves the tangent vector at one point pg with the e representation L to 
the tangent vector at another point pg> with the e representation L — Trpg/LpQ. Of course, 
this definition requires the coincidence between the set of e representations at the point 9 
and that at another point 9' . Hence, this type of e parallel translation is defined only for the 
symmetric inner product (X,Y) P ^ X , and its definition depends on the choice of the metric. 
Indeed, the e parallel translation can be regarded as the dual parallel translation of the m 
parallel translation concerning the metric (X, Y) p e ^ x in the following sense: 



where X is the e representation of a tangent vector at p$> and A is the m representation of 
another tangent vector at pg. 

Further, a one-parameter quantum state family is called a geodesic or an autoparallel curve 
when the tangent vector (i.e. the derivative) at each point is given as a parallel transport 
of a tangent vector at a fixed point. Especially, the e geodesic is called a one-parameter 
exponential family. 

For example, in an e geodesic with respect to SLD {pg\9 e R}, any state pg coincides 
with the state transported from the state po along the autoparallel curve in the direction 
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L by an amount 9, where L denotes the SLD e representation of the derivative at po- We 
shall henceforth denote the state as H e L s pq. Similarly, H S L b po denotes the state transported 
autoparallely with respect to the Bogoljubov e representation from po in the direction L by 
an amount 9. 

When the given metric is not symmetric, the e parallel translation moves the tangent 
vector at one point 9 under the e representation L to the tangent vector at another point 
6' with the e representation L' - Tr pe>L' with the condition L + L* = L' + (£')*■ That is, 
we require the same Hermitian part in the e representation. Hence, the e parallel translation 
nf^po.p^, coincides with the e parallel translation pg / with regard to its symmetrized 
inner product. Therefore, we can define the state transported from the state po along the 
autoparallel curve in the direction with the Hermitian part L by an amount 9 with respect 
to RLD (A, p), and denote them by U e L r p (H 6 L x po, po), respectively. However, only the 
SLD one-parameter exponential family {H^ s po\s G M} plays an important role in quantum 
estimation examined in the next section. 

Lemma 1 Tl e L a, n? h a , U e L a and H e r k a may be written in the following formal 

^^e-'-Weiy 1 , (17) 

U{ iT a = e-^ 6 ^e 6L ^, (19) 
n e T ^^e^^e^^^e^^, (20) 

Ij, 2 

where we choose Hermitian matrices L r and Li as L = j{u^^L r a^ + a^L r a~i) and L — 
T;(<T~i Licri + Lia^i), respectively, and 

Ms (0) d ^logTret L <7e^ 

p b {9) d = logTre lo s CT + eL , (21) 

^(^^logTr^e^-V^, 
p 1 / 2 {9) = logTrcr*e ?a 2 e ju*. 

Proof. Taking the derivative of the RHS of and JTSJl, we see that the SLD (or 
Bogoljubov) e representation of the derivative at each point is equal to the parallel transported 
e representation of the derivative L at a. In the RHS of JT5J, the RLD e representation of the 
derivative at each point is equal to the parallel transported e representation of the derivative 
\fa 1 L r ^fa at a. Further, In the RHS Q20JI. the | e representation of the derivative at each 
point is equal to the parallel transported e representation of the derivative L r at a. 

Conversely, from the definition of x a, we have 

dIl e L x a l 
— — = E P9tX (L-TrLpg), x = s,r,-. 

Since this has only one variable, this is actually an ordinary differential equation. From 
the uniqueness of the solution of an ordinary differential equation, the only H S L x a satisfying 
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H° L x a — a is the one given above. Since any e representation a has the form ^fa 1 L^fa with 
a Hermitian matrix L, we only discuss pe — T\® , ir _ a. Taking: its derivative, we have 

Vct Ly/a,r ° 

^ CT ' r = Pe(V& ^v^-Trpev 7 ^ Lv^)- 

Similarly, from the uniqueness of the solution of an ordinary differential equation, only the 
state family l|19|) satisfies this condition. □ 

5 Non-Geometrical Characterization of Divergences in Quantum Systems 

First, we briefly characterize quantum analogues of divergence from the non-geometrical view- 
point. A quantity D(p||cr) can be regarded as a quantum version of divergence if any commu- 
tative states p and a satisfy 

D(p\\a) = D(p\\p), (22) 

where p and p is the probability distribution consisting of the eigenvalues of p and a. If a 
relative entropy D(p\\a) satisfies the monotonicity for a POVM M = {Mi}: 

D(p\\a) > D(Pf ||Pf ), Pf (i) d ^ Tr pMi (23) 

and the additivity 

D(pi ® P2hi <8> cr 2 ) = D(pi\Wi) + D(p 2 \\a 2 ), (24) 

then Hiai & Petz |S]'s result yields the relation 

D(o® n \\a® n ) D(P™\\P™) 
D{p\\a) = lim [P lla > > hmsup V p — °— = D(p\\a). (25) 
n jvt ti 

That is, the quantum relative entropy D(p\\a) is the minimum quantum analogue of relative 
entropy with the monotonicity for measurement and the additivity. 
Further, Hiai & Petz |S] showed the inequality 

D(p\\a) < D(p\\a) d = Tr plog(pi a^p^). (26) 

6 Quantum Path-divergences Based on e-Parallel Translation 

Now, using the concept of the exponential family, we extend the path-divergence based on 
the first equation in (|f f (I . For any two states p and a, we choose the Hermitian matrix L such 
that the exponential family {n^ x ^}ee[o,i] concerning the inner product Jg >x satisfies 

ni,^ = p. (27) 

Then, wc define the x-e-divergence as follows: 

D x e) { P \W) = / Je, x 0d6, (28) 
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where Jg tX is the Fisher information concerning the exponential family Tl L x a ■ Since n ilg;+;8i2 j , 
cr 2 ) equals (H^ x ai) <g> (n^ 2 ^02), 



D x e \pi® 



P2W1 <X> cr 2 



) = J D( e )(p 1 ||a 1 )+^ e )(p 2 ||a 2 ), 



(29) 



i.e., the e-divergence satisfies the additivity for any inner product. 
Theorem 1 When 



1 , 1 1.1 



L = < 



2 log a 2 (a 2 pa 2 )2 a 2 
logp-logCT 

i [<7 _ 2 l0g((T~ 2^(7^2 )tT2 

1 , _ 1 _ J_ , _ In 

+ CT2 fog(CT 2^ 2 Jcr 2 I 

_I, / _I I -In I 

a * log(er 4 p2 cr 4 jo-4 

I, / _ A I _ A \ _ 
+U* l0g((T 4/320- 4 Jcr 



/or a; = s 
for x = b 
for x = r 



for x 



the condition \2T\j holds. Hence, we obto 



1 . 1 1,1 



Di e) (p\W) = 2Trplogcr"2( cr t /9cr i)f cr 
Di e) (p\\a) = TrpQogp- log a) = D(p\\a) 
Di e \p\\a) = Trplog(pi<7-V') - #(pH 

-Di^(p||cr) = 2Tr((7'p'lT')((T _ 'p^(T _ ') log(c7~'p'(7~'). 



(30) 



(31) 
(32) 
(33) 
(34) 



Nagaoka 2 obtained the above results for x = s, b. 

Proof. When we substitute (|30[1 into L, condition (|27|l can be checked by using Lemma 
n In this case, L r = log(er~ 2 p<j~2 ) ; Li — 2log(<j~4p2<T~4), and we can show that 



d 2 p x (9) 
d9 2 



Hence, from a discussion similar to we can prove that 



^i e) (plk) = 



dp x {9) 



d6 



(l-0)-Mx(l) + Mx(0) = 



dp x (9) 



de 



(35) 



(36) 



where /^(fl) is defined in Theorem 2] Using this relation, we can check (|31f) . (|32|) . and l|34|) . 
Concerning (|33|) . we obtain 

-D^(/9||cr) = Tr <70-~i/9(7~2 log(cr _ ^pCT - ^) = Trplog^^cr-ip^)^ 

where the last equation follows from the equation with AU = er~^ p^ (A is Herimitain and U 
is unitary): 

AUU* A \og(AUU* A) = AU \og(U* AAU)U* A. 



□ 

Now, we compare these quantum analogues of relative entropy given in (|31|I - H34|I . As is eas- 
ily checked, these satisfy the condition ill' 21) for quantum analogues of relative entropy. Let M 
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be a measurement corresponding to the spectral decomposition of a 1 / 2 (<j 1 f 2 pa 1 / 2 ) 1 /' 2 a 1//2 . 
This PVM M satisfies that D i s e \p\\a) = D(P^ ||P^). Thus, from the monotonicity for 
measurement concerning the quantum relative entropy D(p\\a), 

D(p\\a) > Dl e) (p\\a) = 2Trplogo-~5( -§ p(7 2)3 --5. (37) 

From (|2)J), 

D(p\\a) < D^{p\\a) = Tr plog(pi a' 1 pi). (38) 

Hence, from the inequality (|25|l and the additivity i|29[l . Ds(p\\a) and D r e \p\\a) do not 
satisfy the monotonicity even for measurements because the equality in l|37l) and (|38|l does 
not always hold. 

7 Quantum Path-divergences Based on m-Parallel Translation 

Further, we can extend the path-divergence based on the equation Q12JI. For any two states p 
and cr, the family {(1 — t)p + tcr\0 < t < 1} is the m geodesic joining p and a. Hence, as an 
extension of (|12J) . we can define the x-m divergence as 

D x m) {p\W) = f Jt.Jdt. (39) 
Jo 

Since the family {(1 — t)n{p) + tn(cr)\0 < t < 1} is the m geodesic joining n(p) and k(ct) for 
any TP-CP map k, we have 

D^\p\W)>D^\<p)Ma)l (40) 

i.e., the m divergence satisfies the monotonicity. Since the RLD is the largest inner product, 

Di m Hp\\a)>Di m Hp\\a). (41) 

We can calculate the m divergence as 

Di m) (p\W) = Trp(logp-lo g( T) = D(p\\a) (42) 
Di m Hp\\a) = TTp\og(^a- 1 ^) = D(p\\<j). (43) 

In fact, The Bogoljubov case l|42|l has been obtained by Nagaoka ^UJi ancl follows from 
Theorem Hence, Tr p\og{^/pa^ 1 ^fp) — Di m \p\\a) satisfies the monotonicity for TP-CP 
maps. Also, from (|4*T|> . we obtain Tr p\og( y /pa~ 1 ^/p) > D(p\\a)^. 

Further, all of x-m divergences do not necessarily satisfy the additivity (|29J) . At least, when 
the inner product J x _g is smaller than the Bogoljubov inner product Jb,e, i.e., Je.x < Je,b, we 
have D(p\\a) > D x m \p\\a) . From (|25|l and the monotonicity (|40|l . D x m \p\\a) does not satisfy 
the additivity For example, SLD m divergence does not satisfy the additivity 

We can now verify whether it is possible in two-parameter state families to have states 
that are e autoparallel transported in the direction of L\ by 6 1 , and in the direction L2 by 2 . 
In order to define such a state, we require that the state that is e autoparallel transported first 
in the L\ direction by 6 1 from poj then further e autoparallel transported in the L2 direction 
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by 9 2 coincides with the state that is e autoparallel transported in the L2 direction by 9 2 from 
PO; then e autoparallel transported in the L\ direction by 6 . That is, if such a state would 
be defined, the relation 

nf 2lX ni 1 1 ,x^ = ni 1 1 , x n£,^ (44) 

should hold. Concerning this condition, we have the following theorem. 

Theorem 2 The following conditions for the inner product Jg -X are equivalent 

© Jg tX is the Bogoljubov inner product, i.e., x = b. 

© The condition Um holds for any two Hermitian matrices L\ and L2 and any state po. 
© D x e) (pg\\pg) = D^(9\\9). 
® D x e \p\\a) = D{p\\a). 

© D^iprM^D^rM- 
© D x rn) {p\\a)=D{p\\a). 
Here, the convex functions p{8), v{rj) and the states pg, p v are defined by 

p/= f cxp(^^-/i(e)), 

i 

/i^^logTrexp^^XO, (45) 

i 

3 

i/fa) A ^D x m \p a \\p^) = -H( Pv ) + H(p abc ), 

where Xi, . . . , Xk is a basis of the set of traceless Hermitian matrices, and Y 1 , . . . , Y k is its 
dual basis. 

This theorem implies that only the quantum path-divergence based on the Bogoljubov Fisher 
information can be characterized by the convex function among quantum path-divergence 
based on m-parallel translation. 

Proof. First, we prove that ©=>©. Theorem Q] guarantees that Bogoljubov e autoparallel 
transport satisfies 

where p b {9) d = logTre log '>+ elLl+e2 - L2 . Hence, we obtain ©. 

Next, we prove that @^@. We define pg d = rFj^, • • • ,H e ^ ib p mix for 6 = (6 1 , . . .,9 k ). 
Then, the condition © guarantees that pg — Tl}^ (gi-gi)x- xP~ e ~ ^ n P ar ticular, when 9 = 0, 
we obtain pg = g ix x p m i x . Since 9 l X t is commutative with p m ; x , we can apply the 
classical observation to this case. Hence, the state pg coincides with the state pg defined in 

(ESJ. 



12 Characterization of quantum analogues of relative entropy 



Let Xj,g be the x-e representation of the partial derivative concerning 9 3 at pg. It can be 
expressed as 

Xj.g — Xj — Tr pgXj + Xgj, 
where Xgj is the skew-Hermitian part. Thus, 

dTl ' PeX > = Tr ( ^X 3 ) = Tr ( ^(X, - Tr^X,) 

= ReTr (jjjj[(Xj ~ Tr pg Xj + Xg^ = ReJe, x; i,j. 

Note that the trace of the product of a Hermitian matrix and a skew-Hermitian matrix is an 
imaginary number. Since Re Jg t x;i,j = R^J$,x;j,i: w ^ have ^ T ggf X ° = 9 T gg f X ' ■ Thus, there 
exists a function such that p,(0) = /z(0) and 

W) rp y- 

This function £t satisfies condition (D. 

Moreover, since Trp m i x Xi = 0, from the definition |@J, we have p{9) — p(0) = £>^(O||0). 
Since the state p mix commutes the state pg, the relation (p m ix||pe) = At(0) — A*(0) holds. 
Hence, we obtain p{9) = p,{9). 

Further, we have D^{9\\9) = D(p\\9). Thus, the equivalence between © and © is trivial 
since the limit of D(pg\\pg) equals the Bogoljubov inner product Jb.g- Hence, we obtain ©=>©. 

Now, we proceed to the proof of ©+©+©+©=>©. In this case, the function iy(rj) coin- 
cides with the Legendre transform of p(9), and rji = §jfr(9). Hence, D u (rj\\fj) = Z? M (#||#) = 
D(p fl \\p n ). The second derivative matrix j coincides with the inverse of the second 

derivative matrix ggiggj , which equals the Bogoljubov Fisher information matrix concerning 
the parameter 9. Since the Bogoljubov Fisher information matrix concerning the parameter 
rj equals the inverse of the Bogoljubov Fisher information matrix concerning the parameter 
9, the Bogoljubov Fisher information matrix concerning the parameter rj coincides with the 
second derivative matrix g SjLj • Hence, from (JSJ), we have D v (r)\\fj) = (pg\\pg). 

Next, we prove ©=>©. Since p m i X = po commutes with p v , the m divergence D% (pa\\p v ) 
coincides with the Bogoljubov m divergence D^^p^Wp^), which equals the Legendre trans- 
form of p{9) defined in lfl5|) . Thus, D x m \p i )\\p ri ) = D"(i]\\ij) — D(p n \\p ri ). Finally, taking the 
limit f\ — > r\, we obtain J x „ — Jb lV , i.e., ©=>©. □ 

8 Concluding Remark 

In this paper, we proved the additivity of e-divergences and the monotonicity of m-divergences. 
We also found interesting relations between geometrical path-divergences and an operator- 
algebraic divergence as 

DM(p\\*)=Dl™\p\\a)=D(p\\a). 

In addition, we obtained the characterization of Bogoljubov inner product as Theorem [21 
which is a generalization of Amari & Nagaoka p^'s characterization. It is expected that these 
characterizations are applied to quantum information. 
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